Length | Sentence |
---|---|
13 | 中信证券:持有北京银行2. |
13 | 周三收盘,道指上涨147. |
13 | 朴槿惠说,相信在“政府3. |
13 | 截止收盘,沪指报2212. |
13 | 截止收盘,沪指报2204. |
13 | 截止收盘,沪指报2183. |
13 | 截止收盘,沪指报2097. |
13 | 截至发稿,沪指报2065. |
13 | 婴儿的饮食原则很简单:1. |
13 | 沪基指报震荡微涨,上涨0. |
Length | Sentence |
---|---|
13 | 微齐磊:这句话基本是废话! |
13 | 决战凌霄殿,请叫我第一名! |
27 | 《AaaaaAAaaaAAAaaAAAAaAAAAA! |
Length | Sentence |
---|---|
13 | 记者:听说公司会罚你钱吗? |
13 | 长得好看获得奖金,行不行? |
13 | 英国消费杂志《Which? |
13 | 泛舟洞庭寻访江豚,你敢吗? |
13 | 陈启宗:你在海外很积极吗? |
13 | 小米手机3使用技巧有哪些? |
15 | 买存储卡送Galaxy S4? |
15 | 1张纸告诉你瓜帅多自信 首发? |
Here we see the absolutely shortest sentences in the corpus. In three tables we find declarative, exclamatory and interrogative sentences.
The sentences give some insight into the language or the corpus. Moreover, in the case of malformed sentences they may give hints for better preprocessing.
We find only sentences which were accepted by the preprocessing. For language detection, usually a minimum number of known words is necessary. Because of this, some very short sentences may be missing in the corpus.
select char_length(sentence) as le, sentence from sentences where sentence like "%!" and 40>length(sentence) order by le limit 15;
4.1.2 Sentences of fixed length I
4.1.3 Sentences of fixed length II
4.1.4 Sentences of fixed length III
4.1.5 Longest sentences